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Abstract: 



The multimedia interactive conferencing application (MICA), a 
personal-workstation application for multipoint visual teleconferencing, is 
described. MICA allows people at two or more locations to share visual 
niaterial such as documents, photographs, and computer screens in a 
highly interactive way. It supports the distribution, storage, retrieval, and 
high-quality display of visuals, real-time interaction by pointing and 
aimotation, and meeting services facilities. The context of multimedia 
teleconferencing and computer-supported cooperative work is 
established, relating earlier research to the design of MICA. The services 
MICA offers are outlined. The handling, compression, and display of 
multiple media, and the design of a suitable user interface for MICA are 
discussed. 
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Ahstract^^he mullimcdU intcnctive confenoclng appUcatloD 
(MICA) Is a personal-workstation appOcatioD for muItfpoiBt 
viaoal tdccoflfoucing. It aUews people mt two or mm locadoiu 
to share visual material such as docnments, photographs, and 
computer screeats, in s highly interactive way. It supports the 
distribntlon, storage, retrieve aad b^H|DBllty display off visuals, 
real-time faiteractfaui by pointing and annotation, and meeting 
servtoes fecWtles. in this paper we esUbttsh the context of 
■iDltlmedla tefeconferesctag and computer sopported cooperative 
wort» ralatiBg earlier researrfo to our dcslga of MICA. We ouUlne 
the services MICA ofCers, and then focas oo two of the major 
technical challenges: the handling, compression and display of 
multiple media, aad the design of a s^table user mterfaoe. A third 
m^or area, the mnltipoint servkse Hial supports the appUcation. 
la detailed In a oompanloa paper. 

I. iNTRODIJCnON 

A RECENT trend in multimedia image handling and dis- 
play has been the move from dedicated systems with 
special-purpose hardware towards ^plications that run on 
conventional wrnkstations. The graphic quab'ty and processing 
power now available make It possible to handle lext, graph- 
ics, scanned documents and natural (photographic) images 
within standard desktop personal computers. At the same lime, 
research in mullipoint communication has provided very pow- 
erful services for communiCTting l>eiween dislributed worksta- 
tions. These twin technology developments promise new solu- 
tions to a well identified applicab'on area — multimedia visual 
teleconferencing. Teleconferencing is not the only application 
that relics on advanced media-handling capabilities— others 
include publishing and multimedia databases — but it is par- 
ticularly demanding in its use of dynamic interaction over a 
network. 

The objective in teleconferencing is to provide a shared 
space wherein users can interact as Ihey would in a face-to- 
face meeting. Visual teleconferencing is a subfield, defined as 
the sharing, to real lime, of visual material by participants. 
It should be differentiated from the term videoconferencing 
which is reserved for full-motion video support between partici 
pating sites, allowing meeting participants to sec each-other's 
nonveri>al cues and giving a sense of presence to the meeting. 
The focus in visual teleconferencing, though, is on what 
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participants talk about; for example, documents, slides and 
pkiures. We have adopted the term visual teleconferencing, 
rather than (for example) audiogniphic conferencing, because 
of the latter's emphasis on graphics rather than visuals in 
general. 

A measure of the success of a teleconferencing system 
is how well the interaction between paiticipancs simulates 
that available in a fecc-to-facc mectm^ We shall suggest in 
this paper that such interaction may even be enhanced by 
appropriately applied technology. 

II. Objectives, Requirements, and Constraints 
The task-domain objective for a visual teleconferencing 
system may be stated as follows. The system should allow 
participants in teleconferences to share visual materials as 
effectively as they can in conventional meetings. 

This objective leads to the foimulation of requirements for 
a conferencing system. We list the key requirements below. 

1) Should allow any number of participants at any number • 
of sites. 

2) Should allow all participanLs to be peeis. (As the number 
of participants increases, Ihe t^ponum'lies to interact 
will decrease and social prol€x:ols will dictate the con- 
duct of the meeting. Nevertheless, the technology should 
not impose an artiAcial hierarchy on participants.) 

3) Should allow the sharing of all visual media used 
in meetings. Essential arc paper and a blackboard/ 
whiteboard facility. Namral images and computer 
graphics and screens are also required. 

4) Should allow die interactive updating and annotation of 
all visual media, and the saving and printing of vbuals 
at any location. 

5) Should be pt>ssible both to organize and deliver formal, 
highly stnictured presentations, and to introduce visuals 
out of sequence and without warning from any site. 

The above requirements arc to be met subject to constraints. 
The following constraints arise out of the desires to make a 
system a widely applicable and as economical as possible. 

1) Should be realizable on popular desktop computers. 

2) Should allow arbitrary combinations of optional periph- 
erals for media input and output. 

3) Should be independent of Ihc communication network. 

4) Should be u.<;able after a short demonstration. The avail- 
ability of advanced features shouM not hinder novice 
users. 
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In the following sections we discau^s previous work ad- 
dressed at visual teJeconfcrencuig, then describe a system 
designed to oieet the above ojectivc and requirements. 

III. Prior Work 

IWo strands of prior work have particutar importance to the 
oiijective stated above. The Srst is conventional visual tele* 
conferencing; the second is computer supported cooperative 
work (CSCW). 

Visual teleconferencing has traditionally relied on special - 
purpose hardware terminals to provide for muldpoint distri- 
tnition of visual material. A detailed review of such systems, 
including telewriting, elecontronic blackboards, frecze-firame 
and document conferencing systems is given in [1]. In some 
casesj only computer text and graphics ace supported (ex> 
amples are the Gamma Teleskeich and the Telewriter 11 PC 
[2]). while others allow the display of freeze-frame video 
[3\ HI or scanned documents [5J, (6]. Increasingly, several 
media have been combined into a single .^system. Sometimes 
telewriting (freehand graphics) is combined with audio. [7]. 
Other alternatives are facsimile phis telewriting [8] and fireeze- 
frame video plus computer text and graphics [9]. Several 
truly multimedia systems have been reported in the literature 
[10J-[1^]- These exploit the advances in graphic displays 
mendoned al>ove, and in scmie cases combine motion video 
with the traditional media (INVITE-64 111]). Most visual 
conferencing terminals allow pointing on, and annotation of 
media, though even in advanced systems, users can often 
only do this one at a time [14]. Some systems are intended 
for U.SC over broadband networks ([12], [13J), while others 
are ISDN terminals, in all cases, they use special-purpose 
hardware, though the teraiinal is often intended to be used for 
personal computing and other communications tasks as well 
as conferencing. 

In our own earlier woik on visual teleconferencings we 
developed a model of the processes involved in interBCtive 
visual conmiunications [15]. This provided a framework within 
which to design our early experimental systems [16 J, and its 
infhicnce is seen in various aspects of MICA. A major empha- 
sis of the model was the assumption that "the participants in a 
visual conversation should have the same scene in their field 
of view at all times; thus new visual information generated 
by one participant should be communicated tnmiediately to 
the other participants/' This led to the definition of a shared 
visual ^ace^ based on Thompson's common information space 
[17], and retained in MICA, as we shall sec below. 

The se«xind strand of prior research is computer supported 
cooperative work (CSCW), a field of increasing interest today 
(see [18] and [19] for two somewhat different perspectives on 
CSCW). It differs from Uadilional visual teleconferencing in 
two major ways. First, it deals with communication tjetween 
general-purpose computer workstations that have standard 
hardware, but may differ from each other. Second, it em- 
phasizes native computer media rather than paper, vugraphs, 
or photographs. In addition, CSCW, still io its infancy, is 
dealing with a variety of human behavioral issues about 
how computers may be used to support coopcraiing (or even 
noncoo^xrraling) groups. 



Several conunercially available personal computer programs 
may be termed CSCW applications. These include group 
writing tools [20] and application sharing [21 J. The latter is 
of particular interest since it allows the simultaneous viewing 
of live (i.e., active) applications on multiple workstations. 
Thus a user of a word processor or spreadsheet (for example) 
may show work-in-progress to remote colleagues, and make 
modifications as they watch. 

MICA builds on earlier work in visual teleconferencing in 
its support of several common visual media (multimedia), 
real-time simultaneous freehand and structured annotation 
(interactive), and muliipoini working (conferencing) over long 
distances. From CSCW, it borrows the idea of using standard 
workstations rather than special-purpose hardware. By the 
same token, a variety of configurations must be supported, 
including options for document and video input. Thus, rather 
than a hardware-specific system, MICA is an hardware-flexible 
application. MICA also addresses native computer media, 
allowing the sharing of captured screens and windows, text 
files, and. with an appropriate faaidware configuration, live 
applicatioK>. 

The technical issues that arose in the development of 
MICA include ihe capture, display and compression of the 
supported media, the user interface design and the multipoint 
communications service (and ^oiocol). In each of these areas 
we have new results to report. Most are dealt with in this 
paper. Providing a multipoint service that allows simultaneous 
interaction, consistency control and effective handling of Ihe 
point-to-point case, all over long distances, is considered 
separately in our companion paper [22] A further publication 
[23J details a study into the display of natural images on 
graphics displays as used in MICA. 

rv. Overview of MICA 

A, MICA Services and Features 

MICA allows people at two or more sites to hold meetings at 
a distance and share visual material such as slides, docimients 
and photographs. Visuals arc displayed on a high or medium- 
resolution screen in an area called the shared space, which 
occupies most of the screen. Within the shared space, all 
connected sites display the same thing. Users at any site can 
introduce visuals, point and annotate on the shared space, 
and their actions arc made visible at all sites. MICA allows 
free interaction, simultaneous annotation and control of the 
conference by any participant. Access to the control functions 
is via graphical menus displayed beside the shared space. 

The inital implementation of MICA runs on an IBM POAT- 
compatlble personal computer (with cither an 80286 or an 
80386 processor), with display, communications and input 
hardware. There is no special^purpose hardware; the sys- 
tem is intended as a workstation application rather than a 
stand-alone conference tool. The user can choose either a 
medium-resolution (640 x 480 pels, VGA standard) or a high- 
resolution (1024 X 1024) display, either a roou.<;e or a tablet 
and stylus, cither a LAN, ISDN, or X.25 network interface. 
A document scanner, printer and video camera may be added 
if required. 
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of the shared space on each screen. 



Wbatever hardware configuratton a user has (and the alter* 
natives above represent only the initial implenienlations)^ that 
system can receive and display all media^ receive and display 
ail annotations^ coramunicate mulUpoint and iDterwork with 
aJI other configurations. Furthermore, all sites in a conference 
arc peers; there arc no master/slave relationships. Fig. 1 shows 
an example conference, illustrating the shared space concept 
and interworking between terminals of different hardware 
configuration. 

The shared space can be used to display the current slide (a 
multimedia visual) or the conference monitor. The meeting 
is set up using the conference monitor, which also gives 
information about remote participants and the progress of 
the meeting. Most of the time the shared space is used 
for displaying slides; pre-.stored slides for presentations, and 
documents and video images spontaneously input through the 
scanner and camera for unplanned contributions. Although 
the shared space looks identical at alj participating sites, the 
menus may look different depending on the facilities selected 
by users. They include a tray of annotators (or pens) that 
users can pick up and draw with, and a lectern from which 
visuals are presented. Figs. 2 and 3 show the screen of the 
htgh-resoluiinn version at different stages in a conference. 
Fig. 2 shows the conference monitor for a three-point meeiing, 
with photographs of partictpajits displayed (These appear in 
color, but arc reproduced here in black and white). The 
buttons close to the bottom of the screen give access to 



special services. Fig. 3 shows a visual being displayed and 
aimotated. A document has been scanned into the shared qsace 
and now a translucent highlighting pen is being used by one 
participant while the other writes with an opaqve red pen. 
The lectern is also visible; this menu gives access to physical 
devices, together with three colored folders (red. green and 
blue) that leprescnt stored sequenced slide presentations. The 
folders embody mechanisms (based on direct manipulation of 
miniamre-image icons) for accessing, browsing and selecting 
slides for display on the shared space. Fig. 3 shows two open 
folders and one closed. Two miniature images are displayed for 
each open folder. These arc content^generated icons that can 
be manipulated directly to flick through Ibc folder or to present 
slides to the shared space. The miniatures seen in Fig. 3 
represent, from left to right, visuals of the following media: 
GEM graphics, paper, a computer text fjJe, and fireeze-firame 
video, GEM graphics, captured PC screen, and paper. 

B, MICA Software Architecture 

MICA software is highly modular. This was a design 
priority so that addition of new media or new communica- 
tions interfaces, changes in the screen resolution or the user 
interface, or any other modification, would have limited effect 
on the software. This in turn was a consequence of the design 
constraint that arbitrary combinations of optional peripherals 
should be allowed. Fig. 4 shows the MICA software modules 
in their five main groups: control, display interface, commu- 
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Fig. 2. Example of MICA serviccR mode showing three sites connected. Site bars show partidpani piciuies. with the cont/ol menus dbtpIayeU ucruss Ihe botloiu. 



nications, multimedia! pnx£su»*ing and ui^er interface. The lines 
between modules show the main chanocb of data transfer. 

The control group's main function is to provide a non- 
preemptive task scheduler for MICA's six communicating 
processes. These are a user interface process, send and receive 
processes, a clock, and, when slides are being displayed, 
multimedia control and input processes. The use of multiple 
tasks allows us to provide concurrency between local events 
and remote events in a modular way. The scheduler also 
includes various functions that processes can call for synchro- 
nization purposes. The remainder of ihc first group of modules 
includes error reporting hinctions. initialization and debugging 
facilities. 

The second group of modules provide the interface lo the 
graphics display and the locater in a device independent way. 

CommunicaU'oDS software resides in a group of three mod- 
ules; MCL, send and receive. MCL (muhipoial communica- 
Uons layer) is the terminal implementation of a multipoint 
protocol intended for a wide range of applications (sec below). 
It provides all the control for multipoint consistency, includ> 
ing functions for connecting to conferences and exchanging 
data. Send and receive arc appltcation-lcvcl communication 
modules. They provide the extra control, packet izatioiu and 
message structure that is specific to MICA. 

All user aciions are interpreted by the user interface module 



which makes appropriate calls on other subsystems such as 
the conference monitor. The conference monitor includes 
data structures fm maintaining the stale of the meeting, both 
in terms of the sites that are connected (application level 
connection and disconnection), and the users at each site. It is 
accessed in a similar way by both the user interface and the 
receive modules. 

The final group of modules is the multimedia processing 
group. The slide interface module is the controller for all 
major shared space events and ensures consistency both locally 
and globally. It uses media handlers to display the various 
slide types. These modules all involve medium-specific coding 
and display techniques. They interface to the communication 
modules via functions in .slide interface that manage data 
transfer. 

V. MuLTiMKDiA Image Proce^sino for Display 

AND COMPRE.««ION 

Table 1 shows the various media supported by MICA, how 
each is compressed and displayed with algorithms appropriate 
to its type. 

Image compression is necessary lo optimize end-to-end 
throughput and storage, and while standard coding is used 
where appropriate (for structured graphics and for documents), 
new algorithms have been developed for photographic images 
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and PC screens. All media arc coded according to their input 
resolution (to example. 300 dots/in for scanned docnxncnts), 
so the traosmitted data are independent of the sender's or 
receiver's display capability. Thi.9 is essential for compati- 
bility between terminals with different display hardware, and 
also means that all media can be recovered at their original 
resolutions for printing. 

Because the workstation's available display resolution and 
range of colors does not normally match the input image's 
resohstion and range of colors, media-specific algorithms arc 
used to render the image as faithfiflly as possible. MICA 
works whh limited-palette graphics adapters that map the four 
<» eight-bit resolutioD of the graphics memory into sixteen 
or 256 colors chosen Cr<»n a large range of possible colors. 
The mapping is done by means of loadable lookup tables, 
and the set of colors used to display any particular image 
is known as the image palene. For documents, the palette 
includes several gray Icveb, used to compensate for the loss in 
spatial resolution. For photographic images, an image-adaptive 
palette selection algorithm is used. 

A. Paper 

Scanned diKummts are st ored and iran.smil!cd by MICA 
in the T.6 fonnal used in CCTTT Group IV tacsimile. This 
approach was taken for three reasons; 



1) Group IV is a proven and eHicient algorithm for trans- 
mission over reliable channels. 

2) Stored documents may be generated and accessed by 
facsimile programs running on the workstation. 

3) Wfe have an interest in network-based services that may 
include document processing. For interworklng with 
these, we chose to iLse a standard. 

The document display algorithm uses gray levels to com- 
pensate for the reduction in spatial resolution from the 2550 
X 3300 bilevel original, to the 928 or 464 pels available for 
display. The technique is akin to that for producing antialiased 
displays [24]. It invplves dividing the input document into 
cells corresponding to the number of output (screen) pels, 
and counting the ratio of black to white points in each 
cell. This value is then quantized to an appropriate gray 
level for display. Efficient implementations of the algorithm 
make use of Brcscnham's algorithm for arbitrary scaling 
between resolutions, and look-up tables for fast calculation. 
The term SARX (spatial-amplitude resolution exchange) has 
been coined for this process. 

B. Freezc'frtime Video 

The image compression algorithm exploits the redundancy 
in the input picture to reduce the number of bits which are 
required tn represent it from 24 b/pel to an average of 1.5 
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b/pet. with slight degradation in the picture quality. The 
compression scheme is based on a combination of predictive 
and Lap]adan Pyramid coding [25). The morivation is to 
combine the fast implementation of predictive coding with ihe 
progressive transmission feature of the Laplacian Pyramid. All 
processing is done on the YIQ transformed color space, with 
subsampling in the / and Q componctitv. 

The algorithm is based on recursive generation of prediction 
error. In each stcp» a highci-resolutioa picture is predicted 
by interpolation of a lower-resolution picture. The lowest- 
resolution picture in our implementation is based on 8 x J 6 
bloclcs. The four pels at the corners of blocks are iransniiU 
led first and the remainder linearly interpolated at both the 
transmitter and receiver. The prediction errors between the 
interpolated block and the original are calculated at the trans- 
mitter, quantized and transmitted with variable word-length 



codes (runs of zero-error pels arc also coded with run lengths). 

Implcmcntatton of progressive transmission is straightfor- 
ward; an entire low-resolution image is derived first by the lin- 
ear interpolation from block-comer pels. The prediction errors* 
received next, serve to add detail. Having thus sent a mono- 
chrome image, the color components are transmitted in a sim- 
ilar way. In practice, progressive iran.^mission has only been 
used in MICA systems communicating at 4.8 kb/s or below. 

The display of a natural image on a graphics adapter requires 
Ihe creation of an appropriate palette and efficient color 
.selection from the palette during display. A survey of palette 
.selection algorithms in the literature yielded tliree alternative 
approaches. In each case the firer step is to generate the color 
histogram of the image. This is a 3-D matrix whose elements 
represent Ihe RGB color space. Each cell in the histogram 
contains the number of occurrences of the corresponding color 
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in the picture. The firsi approach, the Popularity algorithm, 
simply selects the N most frequently occurring colors (for an 
yv-eotry palette) (26J, A aiodiaed version of the algorithm 
decreases the count of a selecled color's neighbors to prevent 
the concentration of palette colors io one part of the color space 
[27]. The second approach is to arrange that each color in the 
palette will represent an equal number of pels in the image 
[26]. This is the Median-cta algorithm. The third approach 
starts by finding the ^4 most frequently occorring colors 
(where M is many limes greater than AT, the number of entries 
in the palette). These represent a volume in color space. The al- 
gorithm, termed MaxMin [28], then seeks to-find N uniformly 
distributed colors on the boundary of this populated tv^gion. 
Colors within the region can then be represented by combina- 
tions of the palette colors. Using combinations means that the 
image roust either be displayed in higher resolurion than it was 
input (for example using 2x2 cells for each input pel), or 
color quantization errors must be carried from one pel to the 
next (enor diffusion). After a detailed comparison of these al- 
ternatives wc developed a modified version of the MaxMin al- 
gorithm which involves recursive improvement to the selected 
palette after the initial choice of colors. This work is reported 
in detail in [23]. In the high-resohition version of MICA, a fast 
algorithm is used to generate 2x2 palette combinations for 
each displayed pel. In the medium-resolution version* where 
the usable palette in only 10 colors^ dither and error difihision 
are used to achieve a greater subjective color range. 

C. Structured Grapfucs . 

For storage and transmission of structured graphics we 
elected to use GEM primitives [29], Our initial implementation 
of the medium has been GemDxaw based, and it was therefore 
natural ti> read and send unaltered GEM files. The graphics 
drawing routines used for display of structured graphics are 
all our own implementation. They include handling of outline 
fonts, so text is arbitrarily scalable. Decoding of other graphics 
standarcfe is relatively simple; the process consists mainly 
of table l«x>kup transformation. We have also implemented 
display of Macintosh PICT files. Since our initial workstation 
engine is a PC, these files have to be transferred prior to the 
conference. However, our current work includes implementa- 
tion of MICA on a Mac II, and this medium will then have 
greater Importance. 

D. Text Files 

Text files are transmitted uncompressed. MICA incorporates 
a feature to transfer arbitrary files, or complete directories. 
These files arc also transmitted without compression. Text is 
displayed using standard computer fonts. The need for com- 
patibility between systems while displaying lines of reasonable 
length means that the medium- resolution version uses an 8 x 
16 font for text files. This is a solution that needs refining; we 
expect to use enhanced antialiased fonts in the future. 

E. Captured PC Screens 

Text screens arc transmitted uncompressed. Graphics screens 
are compressed with a multilevel run-iengih coding scheme. 
Display of PC screens prcsenis resolution-related problem.s. 



some of which have yet to be satisfactorily resolved. Our 
screen capture program works under all modes of common 
IBM display adaptors (Monochrome, Hercules, CGA, EOA, 
VGA). However, the aspect ratio differences between certain 
modes mean that the vertical dimension sometimes has to 
be scaled when displayed. A SARX solution (sec above) to 
this would be ideal. The choice of palette for SARX requires 
investigation, since there are a large number of possible color 
boundaries on typical EGA displays. A related problem is 
that on the medium-resolution version of MICA, the screen 
is only as large as some of the captured screens. The shared 
space docs nf>t fill the screen, and (he captured version must 
therefore be scaled downwards. One possibhliiy for this is to 
subsample as necessary and hope for the best; this approach 
leads to unpleasant distortions. Another possibility is to display 
captured screens in monochrome and use SARX to give 
smooth boundaries. We arc still investigating options here, 

F. Dynamic PC Screens 

We have initially implemented only text-based application 
sharing on 80386-based machines (that is, the sender must 
have a 80386 processor, receivers need not). Compression is 
by conditional replenishment; only changed parts of the screen 
are sen! as updates. Display of dynamic screens is similar to 
display of static screens. We have chosen lo allow annoUUon 
on dynamic screens, although if a character position is updated, 
the annotation that occupied that space is erased. 



VI, The Ushr-s View of the System 
Figs. 2 and 3 show the user's view of MICA running on a 
high rcsolutaon display. They show a large shared space (80% 
of the scrven area in the high- resolution version, slightly less 
in medium-resolution), with control menus along the bottom 
and the right hand side. The menus are not complex, and 
functions are localized. For example, when visuals arc being 
displayed, presentation facilities are along the bottom menu 
and annotation facilities along the side. Some of the icons 
provide access to submenus. For example, .reelecting a scanner 
icon opens a menu that allows the user to specify whether 
documents should scanned for display in portrait, landscape 
or automatic format (automatic format selection relies on 
landscape documents being pre-marked In the top right-hand 
corner). 

Little empirical data is available for deciding how a Com- 
puter Supported Cooperative Work interface should look. 
For example, the question of tisers' access to annotators is 
discussed by Schneiderman: •'When several users collaborate 
over an elecliunic document, should they all have uidependcnl 
cursors and the freedom lo update concurrently, or should 
there be a single cursor and some protocol for 'passing 
the chalk'? Intermediate strategies such as multiple cunuirs 
for pointing, but only one for making changes arc possible. 
We could debate the merits of each approach and even 
implement them, but a scientific approach lo evahiatinn can be 
a strong stimulus to sharper thinking" (30). In the absence of 
empirical data on uscnj' models, wc have followed the rule of 
maximizing flexibility wherever possible. Thus our .solution to 
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the aoQOiaCors problcni is to aljow several pens that can be used 
for pointing or amioration simultaneously. If future behavioral 
studies show thai this flexibility leads to user confusion or 
dissatisfaction, wc will limit flexibility appropriately. With 
this proviso about the lack of empirical data, we now discuss 
three major principles of the user interface design: emulating 
the use of visual media in conventional meetings, devising 
meeting metaphors for the various MICA services, and the 
use of content-generated icons. 



A, Sharing Visual Media in Meetings 

One of MICA'S goals was to allow distributed participants 
to discuss visual Tnaterial as well or better than in conventional 
meetings. Taking the model of a semiformal meeting, we 
examined the facilities that arc provided in a conventional 
conference room, ctmsidering their flexibility, case of use and 
consistency. Wc then looked at how MICA, as a universal 
visual display device, could embody the positive features of 
conventional equipment. We were also concerned thai it be 
usable by multiple participants at a single site. In ihb section 
we review this analysis and state our conclusions: 

The principal equipment for sharing visuals in today's 
meetings is the overhead projector. It provides for display 
of hand-drawn visuals and printed vugraphs. Anything repro- 
ducible on a photocopier can be displayed, so long as the 
acetate copy is of approximately letter size. The prissenler can 
point on the visual itself or on the screen, and can annotate 
the visual m color. The size of the image can be varied by 
moving the projector, and when used in smaU meeting rooms 
the image is bright and clear (high resolution). 

TTiirty-fivc millimeter slides are used for photographic im- 
ages and for professional presentations. Expert visual design 
can make very attractive 35 mm graphics. They can be 
prepared on computer systems^ and are therefore easy to alter 
until transferred to film. Slides cannot be annotated. Although 
the size of picture can be altered, and the resolution is always 
very high, the image usually has to be viewed in a dimmed 
room. 

Blackboards, whiteboards and flipcharts allow on-the-spot 
sketching of an idea. They arc also used, in education in 
particular, for their original purpose: the progressive introduc- 
tion of preconceived material. Blackboards allow a teacher to 
introduce verbal and visual material together. 

Computer output is an increasingly important medium. 
Visual material such as alphanumeric btblcs (spreadsheets), 
statistical graphics, text, etc., are used in meetings; either in 
hard copy format, or increasingly, displayed and manipulated 
directly on the computer. Laigc-scrccn projection systems may 
be used for direct display. The image is relatively dim. and 
cannot normally be annotated. 

It is often important in meetings to show particular 3-D 
objects. A special case is when a participant is to be intro- 
duced to the audience,, though there arc many other examples, 
especiaUy in working meetings, where objects are displayed! 
A limitation of many meetings is the mismatch in size between 
the object under discussion (from a miniature mechanical part 
to a building) and ihc number of participants who must view it. 



For all the visual material mentioned above, MICA supports 
appropriate media. Ii is intended also to provide similar 
or increased facilities for interaction, superior consistency 
between the handling of different media, and enhanced control. 
A MICA user may be limited in respect of input hardware 
(for example, a scanner is required to input documents), but 
all users can display all media. 

In comparison with an overhead projector, MICA is able 
to provide equal display functionality except for varying the 
display si2x. Images are rendered very faithfully (by means of 
SARX) and they can be annotated as required. Because multi. 
pic sites are able to modify the image simultaneously, a higher 
degree of interaction is possible than with a conventional 
projector. 

Photographic images can be manipulated (annotated, etc.) 
more powerfully in MICA than in a slide projector. Although 
the resolution of the display is limited by the standard video 
input resolution, the images are adequate for most applicarions 
where photographic images arc discussed. On the other hand, 
35 mm graphics slides need to be sharp and clear In MICA 
the appropriate medium for such information is structured 
graphics. If 35 mm graphics were input via a slide scanner, 
they would suffer a pcrcepUbIc loss in quality when displayed. 
Structured graphics are guaranteed to produce the sharpest 
image possible, whatever the display resolution. 

Computer screen and applications can be displayed within 
MICA. The degradaticm that would result from lepTOduction 
on conventional presentation media is avoided. 

Display of arbitrary objects is possible in MICA within the 
constraints of the available input devices. With a medium - 
quality video camera and zoom lens, it is possible to display 
3-D obfects with resohition appropriate to the details under 
discussion. 

When used by a single person at each site, MICA allows 
easy control and interaction. This is superior to conventional 
equipment, where participants usually have to leave their seals 
to present or annotate displayed visuals. MICA does have 
the same problem when multiple participants are located at 
a single site and must share the locater device (mouse or 
tablet and stylus). It may be appropriate to develop a hardware 
interface for multiple locatcr input. When many users are at a 
single site, MICA also suffers the handicap of its fixed display 
size. We have interfaced the high-resolution system to a large- 
screen projector, which gives acceptable viewing by a hirge 
group. However, the cost of such equipment is high, and the 
image is not as bright or crisp as on a CRT. 

In conclusion, wc see that MICA provides a good distributed 
counterpart to the cq>iipment used in local meetings. It is 
superior to conventional equipment in that, so long as input 
devices are available, all media can be handled, displayed, 
annotated and stored in a consistent way. 



B, Meeting Room Metaphors 

A review of typical visual conferendng systems Jll-fl3J 
shows that user interface designs usually include easy-to-use 
menus, with selections made by clicking. However, Uic value 
of metaphor [31] has nol been exploited previously. Direct 
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TABLE II 

Features op CbnvamoNAL Meehncs aud m Corresponding Metaphoks w mica 



Meeting Feature 



Teleoonfetmce Coxrebte 



Meeting Meiapbor 



Meeting loom: "presencE** and paTtB:i}rBiil» 

Projector (slide cr ovetliead) 
Presentation VImjbIs 
Wriung oa \^u2!& 

Kmiiiig on Visuals 
FlipcbanAtofaiicfaoarU 
Inlenction 



Confcicooe monitor (lisl of panicipania) 

Shsied space display of visuals 
Stored sequence of images 
Drawli^ mode 

Movable poiider 

Blank soccn for annotoiioos 

Multiple data fttreams or batun passing 

Coofctence setup and comnnnication cotnroi 



\^tniHi oonfercoce taMe, photos and 
namctaga of pfirttcipanU 

Miniatures of slides placed on Shared 
Space hy dragging. 

Slides in ring binders; pages tnraed by dragging 
Shared pens (including faigfafigluer^ 
Uqotd paper and eraser) 

Pens arc pointers 

-FUpchan'* viewed rm the jdured i^ace 

Pens may be used simultaneonsly. 

Conttnl » hy human pnolocob 
"Services'* viewed on the slnred spux 



manipulation of famtHar meeting FOom objects (sucb as ring 
binders, pens and namelags) is used in MICA lo provide an 
elegant metaphor-based user interface, l^ible H shows how the 
visual aspects of a conventional meeting can be provided for 
electronically by a teleconference correlate, and the metaphors 
thai MICA uses lo control these features. 

We have adopted the action of dragging as the most im- 
portant mode of control, because it is both an intuitive and 
powerful metaphor for the basic manipulations to be done 
on MICA objects. For example miniatures can be dragged 
to other positions in the ring binders in alhiw browsing and 
arrangement, they can be dragged to the printer for hard copy, 
and Ihcy can be dragged to the shared space for presentation. 

C. Content-Generated Icons 

The user interface incluiles content-generated icons to rep- 
resent objects that the user can manipulate. The iconic forms 
arc generated directly from the data of the corresponding 
objects. So icons of meeting participanls are photographs of 
the appropriate person, and image-slide icons are accurate 
miniatures of the visuals they represent The size nf an icon 
shows its class or fimction. but its visible paltem shows its 
particular cnntcni. Users rely on iconic descriptions of objects 
rather than their names, because the Uxins arc unique and 
instantly recognizable. Note that the use of miniatures to 
represent larger visuals is only one way in which icons can 
be content-generated. In general, a content-generated icon is 
simply one in which the content (rather than the class or 
function) of the object is represented in iconic form. 

Miniatures are generated by the appropriate media- handling 
parts of MICA. The use of multiple grays (SARX) for docu- 
ments means that even at a resolution of 64 x 64 pels, they 
yield sufficiently faithful icons, so that visual coing is very 
fast. We have not done formal tests on the effectiveness of 
miniatures in fast browsing, but the enthusia.>aic response to 
ihem in MICA leads us to believe that other applications could 
benefit from their use. 

Consider a user interface with conventional iconic properties 
plus the ability lo scale text and image data within win- 



dows. The size of characters would be variable continuously, 
with controls associated with the window for zftcming..in 
and zooming-out. Resizing the window may result in some 
automatic ia:aJing of the contents to make best user of the 
available area. At certain points on the continuum of possible 
window sizes il will be appropriate to reduce the window's 
functionality in accordance with its prominence on the screen, 
so that in the limit the window becomes an icon with visual 
(but not functional) similarity to the large window version. 
The user may leave a window in a particular slate so thai its 
iconic form gives cues to its meaning. Thus the system allows 
many windows simultaneously displaying data, possibly in 
several different ways (e.g^ text, scanned document, graphics, 
photographic). 

In a graphical interface where files are shown with content- 
generated icons» the operation of zooming the icon out into 
a window will start up the appHcarion for processing the file. 
This has implications for the way applications are written, at 
least for a multitasking environment. Applications should not 
include functions for opening and closing documents; rather 
opening a document should cause the creation of a new dau 
area for a re-enterable application that may already be handKng 
a number of other documents. 

VII. Trials ExPEWENca Impucations and Conclusion 
MICA has been demonstrated in several forums and has 
been the object of a number of user trials in tele-education and 
ISDN. User feedback has been overwhelmingly positive, with 
the chief criticism being the lack of hard copy. This has now 
been rectified. Other comments have related to partiUoning 
and zooming of the Shared Space, and advanced user interface 
features (for more expert u.scrs). On hardware side, the issue 
of a larger display has been raised,, along with multiple input 
locators at a sing^ MICA station. 

As discussed earlier, MICA can be interfaced to a va- 
riety of communication networks. Specifically, il has been 
used over PC-net (a LAN), ISDN D (16 kb/s) and B (64 
kb/s) channels, DaUpac (X.25 packet data network), Datalink 
(circuit-switched data service), 56 kb/s leased lines and the 
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TABLE m 
CuMMUNKAIK^iS OmoNS nm MICA 


Hetworic 


Dau Rate 


Time to Tnnsmii Typical DocaiDem 
ImaSP Between 80266>B85ed IVnniDab 


Comments 


PC-Ket <LAN) 

ISDN B-channel (HDUC. X25) 
XJS over a paiAel nctworli 

X.25 over circun-switched oetwork 


2 Mb/s 

64 kb/s 

19.2 kb/« 
9.6 lib/s 

9.6 kb/8 
4 J Itb/s 
2.4 kb/s 


15 s (pmccssor limited) 

15 s (pnxxssor limited) 

15 s (processor limited) 
20 s (bandwidlb limited) 

20 s (bandwidth limited) 
40 s (bajidwidtli liinilcd) 
80 ft (bond width Irreiced) 


Limited reacb 

Fustue tiaiKtafll; am wUely avuhble 

Node delays were negligible in all tests to date 

Figures apply to dedicated data services and to 
traosmissaon over oonvcnttonsl lelephone tines. 
Noi^ Krcs lesttH in banscBBsiuo tune increases 



PSTN. Tabic III shows the trade-offs in using different com- 
fDonicaiion options. In particular, it stiows that the time 
required lo transmit a slide depends not just on the chaitncl 
data rale, but also on the complexity of the image and the 
processing power of the PC. In most cases however, the choice 
of coromunicalion option will depend on the environment in 
which MICA is being installed. 

In conclusion, we are also seeking to undefstand marlcet 
requirementN for the many kinds of computer-based telecon- 
ferencing. We are examining several vertical markets, of which 
tele-education is one, with a view to providing enhanced 
communication via MldZA technology. The fnturc success 
of multimedia conferencing products will depend on a good 
understanding, by developers, of the social, behavioral and 
economic aspects of the target user enviroomeni. 
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